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Abstract 

In this paper, we consider Bayesian hypothesis testing for the balanced one-way random 
effects model. A special choice of the prior formulation for the ratio of variance components 
is shown to yield an explicit closed-form Bayes factor without integral representation. Fur- 
thermore, we study the consistency issue of the resulting Bayes factor under three asymptotic 
scenarios: either the number of units goes to infinity, the number of observations per unit goes 
to infinity, or both go to infinity. Finally, the behavior of the proposed approach is illustrated 
by simulation studies. 
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1 Introduction 

Consider the balanced one-way analysis-of-variance (ANOVA) random effects model 

Vij = IJ. + ai + Eij for i = 1,2,... j = 1,2,... ,r, (1) 

where yij is the jth observation associated with the unit i and represents the unknown intercept. 
Here p (> 2) is the number of units and r (> 2) is the number of observations per unit. It is 
assumed that the random effect Oj and the error term Sij are mutually independent, and that 
~ A^(0,(7^) and Eij ~ A^(0, o"^) for all i and j, where iid represents "independent and identically 
distributed." The unknown parameters <t^ and o"^ are often called variance components in the 
literature. For notational convenience, throughout the paper, let and stand for X^f^^ 
Yl^i=i^ respectively. 
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In such a balanced variance components model ([T]), we are often interested in evaluating whether 
the random effects should be included, which is equivalent to testing 



Mo:al = versus Mi : ^ 0. 



(2) 



For ease of exposition, let Nr{x \ 0, represent the probability density of a multivariate normal 
distribution with an r-dimensional random vector of observations x, an r-dimensional mean vector 
and an r X r variance-covariance matrix ^. Without loss of generality, the model ([T]) can then 
be expressed compactly in matrix form as follows 

p 



f{Y I /X, al) = YlNriYi \ filr, S), (3) 

where 1,. is an r x 1 vector of ones and 



i=l 



by letting Y = (yi,-- - ,yp)' with y^ = {yn,--- ,yir/ , 
S = cr^Ir + cT^Jr with 1^ being an r x r identity matrix and being an r x r matrix containing 
only ones. Accordingly, the hypothesis testing problem ([2]) can be equivalently expressed as testing 
the following two models 

Mo:fi{Y\fi,a^al) = f{Y\fi,a\0) versus A'h : h{Y \ fi,a\al) = f{Y \ fi,a^ ,al). (4) 



It is well known by IBox and Tiad ( 19731 ) that the classical unbiased estimates of cr^ can be 
negative even if the true value of cr^ is nonnegative. This is a serious disadvantage of using these 
estimates in frequentist analysis. To avoid this problem, this paper deals with the problem of 
hypothesis testing or model selection based on the Bayesian approach. As mentioned by one ref- 
eree, an operational advantage of the Bayesian approach is that likelihood-based methods require 
special care since the parameter being te sted is a boundary case, le ading to the f ailure of standard 



asymptotic scenarios; see, for example, 



Mailer and Zhoul (|l996l ^ 



Pauler et al. 



119991) and refer- 



ences therein. In addition, there are many other advantages for using the Bayesian approach to 
this problem over the freq uent ist or classical approach. We here refer the interested reader to 
Westfall and GonenI (jl996l ) and lBerger and Pericchil (j200ll ) for more details. 



19951 ) offers a natural way 



From the Bayesian viewpoint, the Bayes factor (jKass and Raftervl . 
of measuring the evidence in data for various competing models in terms of their posterior model 
probabilities. In our problem, the Bayes factor for comparing Mi to Mq given by @ can be written 
as 



5Fi 



10 



mi(Y) 
mo(Y)' 

2 



(5) 



where 

mi(Y) =p(Y I Ml) = III h{Y I IX, a\al)^ll{^l,a\al)d^, daedal 

and 

mo(Y) =p(Y I Mo) = 11 MY \ fi,a^)7ro{fi,a^) dfida^ 

with 7ri(^, (7^, (7^) and 7ro(/^, cr^) being the joint prior densities for the unknown parameters under 
Ml and Mo, respectively. From Bayes theorem, the posterior probabihty of model Mi given Y can 
be expressed through the Bayes factor as 

p(Mi)mi(Y) _ p{Mi)BFio 

P^^' ' ^' ~ p(A/i)mi(Y)+p(Mo)mo(Y) " p(Mo) + p(Mi)i?Fio ' 

where p(Mj) is the prior probability of model Mj for i = 0,1. In the absence of prior knowledge, it 

is natural to specify p(Mo) = p(Mi) = 1/2. Therefore, for decision-making, the model Mi is more 

likely to be selected if p{Mi \ Y) > 1/2, or equivalently, -BFio > 1. 

In the Bayesian framework, it is of particular interest to study the consistency issue of the 

proposed procedures. Here, consistency means that the true model will be selected if enough data 

are provided, assuming that one of the competing models is true. Let Mj- stand for the true model. 



Fernandez et al 



(|200ll ) formally defined the posterior consistency for hypothesis testing or model 
selection as 

plimp(Mr \Y) = l, (7) 

n—^oo 

where 'plim' denotes convergence in probability as n, the total number of observations, goes to infin- 
ity. Due to the relationship between the posterior probability and the Bayes factor, the expression 
([7]) for our testing problem ([2]) becomes 



plim5Fio = oo, (8) 

if Ml is the true model, whereas 

plim BFio = 0, (9) 

n— >oo 

if Mq is the true model. Since n = pr in this paper, we shall mainly focus on the consistency 
of Bayes factor for the hypothesis testing problem in the balanced one-way random effects model 
under three asymptotic scenarios as follows: 
Scenario 1 r goes to infinity, but p > 2 is fixed. 
Scenario 2 p goes to infinity, but r > 2 is fixed. 
Scenario 3 both r and p go to infinity. 



For the hypothesis testing problem in the balanced random effects model, numerous Bayesian 



Westfall and GonenI (119961 1 



approaches have recently been proposed in the literature. For example, 
proposed a new Bayes factor for hypothesis testing in the one-way ANOVA model with either fixed 
or random effects and then studied the B a yes fa ctor consistency under the first two asymptotic 



scenarios. Later, 



Garcia-Donato and SunI (j2007l ) developed the divergence-based prior and the 



intrinsic prior for th e parameter cr^ an d also showed that both priors produce consistent Bayes 



factors. In addition. 



Cano et al 



integral priors introduced by 



(120071) derive d a new Bayes factor based on the methodology of 



Cano et al. 



(|2008l ) . An attractive feature of the integral priors is that 
they take advantage of Markov chain Monte Carlo (MCMC) techniques to produce unique Bayes 
factors often, whereas it is unclear whether or not the integral priors for this testing problem are 
unique from the theoretical viewpoint. Additionally, they do not further investigate the consistency 
issue of the resulting Bayes factor. 

It is worthwhile mentioning that the approaches mentioned above have been shown to perform 
well in a variety of real applications. In most cases, the integral representation is involved in the 
expression of Bayes factors, so numerical approximations will generally be employed. However, 
it is not an easy task in applied statistics to decide which type of approximations to be more 
appropriate, especially when both p and r are extremely large. Moreover, they do not seem to take 
the asymptotic property of the proposed testing procedures into account under Scenario 3, which 
would also be of interest to readers and researchers. 

In this paper, we propose an explicit closed-form Bayes factor without integral representation 
for the balanced one-way random effects model. Of particu l ar no te is that the proposed Bayes 



factor is exactly the same as the one derived by 



Maruvamal (j2009l ) for the balanced fixed effects 



model. In addition, we study the consistency issue of Bayes factor under the three asymptotic 
scenarios mentioned above. It is shown that the resulting Bayes factor is always consistent under 
Mq, but it may be inconsistent under Mi in Scenario 2 due to the presence of a small inconsistency 
region, which can be characterized by the number of observations per unit. 

One may argue that making the distinction between fixed effects and random effects is obscure 
from the Bayesian viewpoint because all parameters could be viewed as random variables. Never- 
theless, as mentioned by one referee, one main difference between the two models is that for the 
random effects model, the dimension of the parameter space under the full model is three, namely 
(/U,(7a,cr), which does not grow when either the number of observations approaches infinity or the 
number of units approaches infinity, whereas for the fixed effects model, the model dimension grows 



4 



as the number of units increases. In add ition, there are two main differences between the present 
paper and the study of Maruvamal ( 2OO9I ) for the fixed effects model. 



(i) In the random effects model, under orthogonality and same magnitudes, one can easily justify 
the reasonability of using the same ( even noninformative) prior for the common parameters ^ 



and a^. We here refer the reader to 



Garcia-Donato and SunI (|2007l ) for more details. Specifi 



cally, we consider the consistency property of Bayes factor under a scenario in which both r 
and p approach infinity. 

(ii) From the Bayesian standpoint, both fixed effects model and random effects model can be 



SmithI (|l973l ). " for the Bayesian 



treated as three-stage hierarchical models. As stated by 
model the distinction between fixed, random and mixed models reduces to the distinction be- 
tween different prior assignments in the second and third stages of the hierarchy." For a 



RendonI (|2002l ). It is noteworthy that 



detailed discussion on the topic, one may also refer to 
the prior fo rmulations for the unknown parameters in this paper are different from the priors 
adopted by iMaruvamal (j2009l ). 



The remainder of this paper is organized as follows. In Section^ we discuss the prior formulation 
for the unknown parameters fi, cj^ and cr^, and then adopt a specific prior distribution for the ratio 
of variance components a'^/a'^, which results in an explicit closed- form Bayes factor without integral 
representation. In Section[3l we investigate the corresponding consistency of Bayes factor under the 
three different asymptotic scenarios listed above. The performance of the proposed Bayes factor is 
illustrated through several simulated studies in Section [H Some concluding remarks are given in 
Section [5l Finally, several useful lemmas and proofs will be provided in Appendix. 



2 Bayes factor 



Direct use of improper priors is unsuitable for the hypothesis testing problem because it may yield 
a Bayes factor up to sorn e undetermined normalizing constants. Intrinsic priors, developed by 
Berger and Pericchil (|l996l ). have been widely used to overcome this potential difficulty on the use 
of improper priors. The idea of intrinsic priors is to convert improper priors into ones suitable 
for comp uting the Bayes factors. We do not revi e w them he r e, bu t rather point the intereste d 



reader to 



Berger and Pericchi J 199 



Giron et al 



20061 ^ 



Casella et al 



Moreno et al 



2OO9I ). 



( 199 



Torres et al 



20031 . 



2011 



20081 ). ICaseha and Morenol pood l 



), among others. 
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An alternative way to avoid such a pitfall of the Bayes factor when using improper priors is to 
choose the same improper prior for "common parameters" that appear in the two competing models, 
although it could be argued that the comm on parameters may chan g e mea nings from one model 



to another. Nevertheless, as mentioned by 



Kass and Vaidvanathan 



(jl992l ). under orthogonality 



(i.e., the expected Fisher information matrix is diagonal), the Bayes factor is quite robust to the 
selection of the same (even noninformati ye) prior adopted for the common orthogonal parameters. 



For the hypothesis testing problem (jU, iGarcia-Donato and SunI (|2007l ) showed that the common 
parameters fi and cj^ are approximately (for a moderate or large value of n) orthogonal to the new 



parameter aa in M-\ . As a resu 



both Mo and Mi (jjeffrevs , 



1961 



t, both fj, and o"^ may be assumed to have the same meanings in 
, Chapter 5), justifying the use of the same noninformative priors. 



Accordingly, under Mq, we can adopt the following prior for fi and cj^ given by 



where c is a constant, and under Mi, we specify 

7ri(//,cr^cj^) = 7ro(//,cr^)7r*(cr^ | a^), 
where a scale family prior for cr^ is adopted and given by 



1 / a, 



(10) 



(11) 



(12) 



with 7r(cj^/cr^) being the prior distribution for the ratio of variance components a'^/a'^, which will 
be specified later. 

Note that the idea of using same noninformative priors for common (orthogonal) parameters 



has b ee n proved to be successful by 



(120071), iBavarri and Garcfa-Donatol (|2008l l 



many statisticians: see, f or example. 



Garcia-Donato and Sun 



Liang et al. 



(j2008l ). to name just a few. To avoid the 



undefined Bayes factors, a proper prior distribution i s often required for the ratio of variance 
components r = ct^/o"^. According to Proposition 1 of iGarcia-Donato and SunI ( 20071 ). the Bayes 
factor given by ([5]) for the priors (jlOp and (jlip along with ()12p can be written as 



BF, 
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(l+rr)-(^-^)/2fl-^^ 



-(n-l)/2 



7r(r) dr, 



(13) 



where Wh and Wt stand for the sum of squares between groups and the total sum of squares, 
respectively, and they are given by 



y- 



and Wt = ^^iy 



ij - y- 



with y.. = J2iJ2j Uij/^ Ui. = "^jUij/r. Various choices of the pri or distribution 7r(r) f or the 



ratio of r have recently been proposed in the hterature. For example, 
advocated the following prior distribution 



Westfall and GonenI (jl996l ) 



TT 



'^«(r) = (l + r)-%,^)(r) 



(14) 



which is also named the hyper-g' prior in 



Liang et al 



(|2008l ). Furthermore, 



Westfall and Gonen 



(jl996l ) showed the consistency of B ayes factor with the choice of p rior p4p for r under the first two 



asymptotic scenarios above. Later, 



Garcia-Donato and SunI (j2007l ) proposed the intrinsic prior and 



the divergence-based prior for r and then investigated the corresponding consistency of Bayes factor 
under the two priors, respectively. As s uggested by one referee, it d e serves to mention here that 

t^ 



the divergence-based prior, developed by 



Bayarri and Garcia-Donatd (|2007l ). is a density function 



proportional to a positive meas ure of divergence between two co mpeting models raised to a negative 



power q. One may also refer to 



Garcia-Donato and Sun 



tooi ) in detail. 



In this paper, we adopt a new prior density for r, often called the Pearson type VI distribution 
with shape parameters a>— 1,/3>— 1 and scale parameter k > 0. The density function of this 
distribution is given by 



KT] 



-a-13-2 



B{a + l,l3 + l) 



'{0,00 



(15) 



where B{-,-) is the beta function. Note that the beta-prime distribution used by iMaruyamal (120091 ) 
is just a special case of the Pearson type VI distribution with k = 1, and that tt^^^t) in ()14p is 
also a special case with k = 1 and a = /3 = 0. To obtain an explicit closed-form Bayes factor, cases 
for which k = r will be of interest to us in what follows. With the use of transformation t = rr, 
simple algebra shows that the Bayes factor in (jl3p with '7t{t) replaced by 'k^'^{t) in (jl5p becomes 



BF^ 
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(n-p)/2-Q-/3-2 / ^ _^ We ^ 



5(a + l,/3 + l) 7o " ' ^' V ' Wt 

where We represents the sum of squares within groups and is given by 



-(n-l)/2 



dt, 



(16) 



WE = WT-WH = ^^iy: 



ij Di 



Observe that the Bayes fact or in (1161) can be ha ndled using a one-dimensional integral. The Laplace 



approximation approach in 



Liang et al 



(|2008l ) may also be employed to evaluate the integral over 
the entire real line. Nevertheless, it seems difficult to choose the appropriate types of the ap- 
proximations that we should employ in practice and to assess the quality of these approximations. 
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especially when both p and r are extremely large. Of particular note here is that with the use of the 
above prior distributions, we can derive an analytical closed-form Bayes factor with an appropriate 
choice of /3 in the following theorem. The proof is straightforward and is thus omitted for brevity. 

Theorem 1 With the priors given by ilO\) under Mq and by ill]) . (|12p and Iil5\) with k = r and 
/3 = {n — p)/2 — a — 2 under Mi, the Bayes factor given by il6\) turns out to be 

r(p/2 + g + l/2)r((n - p)/2) ^ 

"""^^^ - r((n-i)/2)r(a + i) [w^J ■ ^''^ 

Notice that the Bayes factor in ()17p has an explicit closed-form expression without integral 
representation, which can easily be calculated by using standard statistical software such as Matlab 
or R, and is readily accessible to non-statisticians in real applications. In other words, the Pearson 
type VI prior for the ratio of variance components provides a simple way of avoiding complex 
computational difficulties in the case where evaluation of the Bayes factor includes solving integrals. 
It is worth noting that the expression of Bayes factor given by ()17p exactly coincides with the 



one m 



Maruvama 



((20091) for the balanced fixed effects model. Such an expression agreement is a 
consequence of the special choice k = r and /3 = {n—p)/2 — a — 2 in the Pearson type VI distribution 
and may be unavailable for other choices of k and /3, even if both fixed and random effects can be 
treated as random variables from the Bayesian viewpoint. 

At this stage, the hyperparameter a in the expression of Bayes factor (I17p has not yet been 
assigned. It is well known that in Bayesian statistical analysis, choosing the hyperparameters of the 
prior distribution has a large impact on the behavior of Bayes factor. In this paper, we recommend 
— 1/2 < a < 0. It has been shown in the simulation studies that the proposed Bayes factor is quite 
robust to the choice of a G [—1/2, 0]. Note that the prior 7r^'^(r) with /3 = (n —p)/2 — a — 2 



depends on the samp 
example, iMaruyamal i 



e size n; this kind of prio r has also been adopted by ma ny authors; see, for 



20091) 



Liang et al. 



Maruyama and George 



2011 



), to mention just a 



20081 ). 

few. As the sample size grows, the prior 7r^"^(r) has a density in the right tail that behaves like 
^-{a+2) ^ leading to a very fat tail for small value of a. Furthermore, it can be seen from Figured] 
that its mode also tend s to and thus th i s prio r puts more weight to small values of r, an attractive 



property considered by 



Gustafson et al. 



()2006l ). It should be mentioned that other optimal choices 
of these hyperparameters such as the one based on the empirical Bayes criterion can be further 
explored in future work. 
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Probability density function of n (t) 




Figure 1: The prior 7r^-^(r) in (jl5|) with k = r, a = —1/4 and /3 = {n — p)/2 — a — 2 for the different 
choices of (r, p) 

3 Model selection consistency 

From the Bayesian theoretical point of view, it is of particular interest to investigate the asymp- 
totic behavior of Bayes factors such as consistency when the sample size approaches infinity. As 
mentioned in Section [H consistency means that the true model will be selected if enough data 
are provided, as s uming that one of the competing models is true. This is formally intr o duced by 



Fernandez et al 



(2001 



Liang et al 



([2003) 



and later is called "model selection consistency" by 
In this section, we mainly focus on the consistency properties of the resulting Bayes factor in 
(jl7p for hypothesis testing in the sense that when the sample size approaches infinity, the Bayes 
factor goes to infinity when the alternative model Mi is true, while it goes to when the null model 
Mq is true. We are now in a position to summarize the Bayes factor consistency under the three 
asymptotic scenarios described in Section [1] as follows. 

Theorem 2 Consider the balanced one-way random effects ANOVA model (CP and the Bayes factor 
for testing Mq against Mi in ^ with the priors given by 110\) . and 112\) as well as U5\) when 
K = r and (3 = p{r — l)/2 — a — 2. 



(a) Under Scenario 1, if r goes to infinity, but p > 2 is fixed, then the Bayes factor in [11) is 
consistent whichever model is true. 
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(h) Under Scenario 2, if p goes to infinity, hut r > 2 is fixed, then the Bayes factor in (11) is 
consistent under Mq and under Mi when cr'^/cr'^ > h{r), while the Bayes factor is inconsistent 
when cr^/o"^ < h{r), where 

h{r) = ri/(^-i) - 1. (18) 



(c) Under Scenario 3, if both p and r go to infinity, then the Bayes factor in (11) is consistent 
whichever model is true. 



Proof. See Appendix for the proof. 



□ 



1.5 



0.5 




■h(r) 



60 



100 



120 



140 



Figure 2: The function h(r) in (llSp used to determine the inconsistency region 

The above theorem has estabhshed the consistency properties of the proposed Bayes factor ()17p . 
which states convergence in probabihty of the true model asymptoticahy being chosen when the 
sample size approaches infinity under the three different asymptotic scenarios listed above. It should 
be noted that under Scenario 2 there exists an inconsistency region (the shaded area in Figure [2]) 
located in a small neighborhood of the null model. This inconsistency region can be characterized 
by the function h{r) in (jl8|) . a decreasing convex function in r, satisfy i ng lim r_j.oo h(r) = 0. See 



Maruvama 



((20091) recently developed 



Figure [2j Under the balanced fixed effects ANOVA model, 
a new closed- form Bayes factor for testing whether the fixed effects are jointly significant and also 
derived a similar inconsistency region given by 



lim 

p^-oo pa^ 



> h{r). 
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Furthermore, iMaruvamal (|2009l ) justified that the existence of the inconsistency region is quite 
reasonable under Scenario 2 from the predictive Bayesian viewpoint. It is of interest to note here 
that the two inconsistency regions are quite similar, which indicates that the variance of the random 
effects (T^ in the random effects model plays a similar role as the limit of af /p in the fixed effects 



model. 



4 Numerical results 

In this section, we aim to numerically illustrate the finite sample performance of the Bayes factor in 
(jl3p for the Pearson type VI prior with various choices of the hyperparameters (a, /?, k) through 
some simulation studies. For this end, we describe how the data sets in the balanced one-way 
random effects model ([1]) are generated. Under Mq, the samples are simulated with = 0, o"^ = 1 
and cj^ = 0, and under Mi, the samples are simulated with /i = 0, o"^ = 1 and cr^ is taken to be 
one of the five different values {0.5, 1, 2, 3, 5}. For each case, we generate data sets with various 
values of p and r in order to mimic the three different kinds of asymptotic scenarios. We analyze 
10, 000 simulated data sets for each case with various choices of p and r. The decision criterion 
used in the simulation study is to select Mi if the Bayes factor BFiq > 1 and Mq otherwise. 

We firstly consider the performance of Bayes factor (I17p summarized in Theorem [2] when k = r, 
j3 = {n-p)/2 - a - 2 and values of a G [-1/2, 0] used are -1/2, - 1/4, - 1/5 and -1/10. The 
relative frequency of choosing the true model under the three different scenarios is shown in Tables 
[11 [2] and O respectively. Rather than providing exhaustive results based on these simulations, 
we merely highlight the most important findings from the first three tables here, (i) It can be 
concluded that the Bayes factor in (I17p is fully consistent under the null hypothesis because the 
relative frequency of choosing the null model is consistently closer to 1 as the sample size becomes 
large, (ii) From the fourth column of Table [2] associated with o"^ = 0.5, it can be seen that the Bayes 
factor approaches as the sample size increases. This phenomenon indicates that even though the 
model Ml is true, the Bayes factor still chooses the null model with probability 1 and thus fails 
to be asymptotically consistent. Such a conclusion exactly matches the statements in part (b) of 
Theorem [21 because in the simulation setup we have c^/o"^ = 0.5 < h[2) = 1 when r = 2. Similar 
conclusions can also be reached for cr^ = 1, but the Bayes factor converges to much slowly. The 
Bayes factor is fully consistent for cr^ = 2, 3 and 5 when the sample size becomes large because 
of (y\lo'^ > h(2). (iii) As one would expect, the Bayes factors are fairly robust to the choice of 
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the hyperparameter a G [—1/2, 0] because similar results are obtained for the various values of a 
shown in the Tables [TJ [2] and El In conclusion, the simulation results clearly support the consistency 
claims made in Theorem[2j Furthermore, it is also noteworthy that the performance of the proposed 
Bayes factor is quite satisfactory even for the moderate values of p and/or r. 

We also investigate the performance of Bayes factor in (|13|) under the Pearson type VI prior 
for various choices of the hyperparameters (a, /3, k), as suggested by one referee. Four different 
choices of these parameters are considered in the following simulation study. C h oosiii g (a, /3, k) = 



Westfall and Gonen 



(—1/2, 0, 1) results in the WG prior ([H]) considered by 
of (a, /3, k) = (—1/2, 0, 1/n) yields the hyper-^i/n prior studied by 
hyperparameter g in Zellner's q-prior. The choice of (a, /3, k) = ( 0, 



Liang et al 



1996ll. T 



le choice 



(120081 1 for the 



prior suggested by 



Box and Tiad (jl973l ) and iBerger and Deelvl ()l988l ). and finally, (a, /?, k 



1/2, 1) leads to the 



(0, 0, r) corresponds to the prior derived by choosing a uniform prior on [0, 1] for the parameter 
rr/(l + rr). It should be mentioned that the numerical integration techniques have been employed 
to approximate the Bayes factors under these four different choices of (a, /3, k) because their 
expressions are not analytically tractable. 

Tables |H [5] and [6] summarize the results based on the above simulation setup. The following 
findings can be drawn from these simulation studies, (i) The Bayes factor with the four choices of 
(a, /3, k) generates compatible results in most cases, except for the case (a, /3, k) = (—1/2, 0, 1/n), 
which can be viewed as the most conservative criterion, having associated the smallest relative fre- 
quency of rejecting wrongly Mq. (ii) Of particular note is that there is no inconsistency region 
under Scenario 2, and thus we may conclude that under the three asymptotic scenarios, the Bayes 
factor in (|13p with these four different choices of (a, /3, k) is fully consistent whichever the true 
model is. (iii) As the sample size becomes large, the relative frequency of choosing the true model 
significantly increases and gets closer to each other under the different proposals; a similar conclu- 
sion can also be made when the variance a"^ becomes large. Consequently, we may conclude that 
the Bayes factor in (jl3p under the different priors for r behaves very similarly. 

As mentioned by a referee, it is noteworthy that the numerical results presented above only illus- 
trated the finite sample performance of the proposed Bayes factor for the different hyperparameter 
values. We make no claim about the convergence rate of the Bayes factor from our simulation 
study. The convergence rate of the proposed Bayes factor under the different asymptotic scenarios 
also deserves further exploration. 

In summary, one appealing advantage of the proposed Bayes factor in (]17p is its explicit closed- 
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form expression without integral representation. It has been observed from Table [T] to Table [6] that 
the behavior of the proposed Bayes factor is compatible with the one for other various choices of 
the hyperparameters (a, /3, k), except for the case under Scenario 2 due to the presence of a small 
inconsistency region around the null model. Note that such an inconsistency region may be avoided 
with some other specific choices of the hyperparameters shown in the simulation study. However, 
a drawback of these choices is that the Bayes factor may not be analytically available. 



5 Concluding remarks 

In this paper, we have developed an explicit closed-form Bayes factor without integral representa- 
tion, which can be easily calculated and is readily applicable to the problem of hypothesis testing 
under the balanced one-way random effects model. It is shown that under the different asymptotic 
scenarios described in Section [H the Bayes factor (jl7p is fully consistent under the null model, and 
is also consistent under the alternative model except for a small inconsistency region around the 
null model characterized by equation (jlSp. Such the inconsistency region is the price we have to 
pay for deriving the closed-form of the Bayes factor, as mentioned by one referee. In addition, the 
referee also presumes that the inconsistency region may disappear when the prior is independent 
of the sample size. Such the presumption is quite understandable, but further investigation is 
needed, especially when the number of observation per unit goes to infinity. Looking at the sim- 
ulation studies above, it seems that the inconsistency region may also disappear with some other 
specific choices of the hyperparameters (a, /3, k) in the Pearson type VI distribution, whereas the 
corresponding theoretical properties of Bayes factors with those specific choices under the three 
asymptotic scenarios are currently under investigation and will be reported elsewhere. 



In some pract ical situations, unbala nced data may occur because of physical 



cost constraints. 



imitations and / o r 



Maruvamal (j2009l ) 



Wang and SunI (j2012l ) have recently generalized the results of 
for the balanced fixed effects model to the ones for the unbalanced fixed effects model. In an 
ongoing project, it will be interesting for researchers to study the consistency of Bayes factor under 
the Pearson type VI prior for the unbalanced random effects model. 



Acknowledgements. The authors are very grateful to the Editor and anonymous referees for 
their constructive comments and suggestions that have substantially improved the appearance of 
this paper. 
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6 Appendix 



Before proving Theorem [21 we first provide three useful lemmas. The proo fs of these lemmas are 



straightforward based on several lemmas in 
here for simplicity. 



Garcia-Donato and SunI (|2007l ) and are thus omitted 



Lemma 1 Under Scenario 1, for any fixed p > 2, when the model Mq is true, 

pWh 



plim 

r— 5>oo V 



exp 



2(p-l) 



and when the model Mi is true. 



y We a ^ aid ^ ^ 
phm — = 1 + - 



(19) 



(20) 



where ci = liuir -^oo Wh /{a"^ + ra'^), which follows a chi-square distribution with p — 1 degrees of 
freedom and is hence a distribution free of r. 



(WHfn-p 
lim ' 

p— >oo 



plim i -777- 1 T 



if the model Mq is true. 



Lemma 2 Under Scenario 2; for any fixed r > 2, 

1, 

1 + ra'l/a'^, if the model Mi is true. 
Lemma 3 Under Scenario 3: both r and p goes to infinity, 

1, if the model Mq is true. 



(21) 



r-foo [ Wt Vt — 1 

p— >oo 



(22) 



if the model Mi is true. 



Proof of Theorem [2} 

It is well known that when x is sufficiently large, the Stirling's approximation to the gamma 
function is given by 

r(7ix + 72) « ^/2^e-^i"(7ix)^i"+^2-i/2, 

where f{x) ~ g{x) means the limit of f{x)/g{x) is one as x approaches infinity, 
(a) By using Lemma [H it is easy to show that, under Mq, 

-{n~p~2)/2+a 



^ T{p/2 + g + l/2)r((n - p)/2) (WeX 
'° r((n-l)/2)r(a + l) \Wt) 

r(p/2 + a + 1/2) ^n^^-(p-l)/2 ( pWn 

fx^n^ I2J '''PV2(^ 
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which obviously goes to under Mo as r goes to infinity because the distribution of Wh under Mq 
is free of r. Under Mi, 

.2^, \ (n-p-2)/2-a 



rp/2 



"""^^^ r(a + i) [2) I' + ^J 

^ r(p/2 + a + l/2) /P\^(P"^)/^ fa,i)/2A , ^ A 

r(a + i) V2y' V %-i V 

which goes to infinity under Mi for cr^ > as r goes to infinity. 

(b) Similarly, by using Lemma [21 it is easy to show that, under Mq, 

which goes to under Mq as p goes to infinity. Here 02(0, r) is a constant independent of p and is 
given by 

, _ \/27rr 

'''^"'''^ " r(a + l/2)(r- 1)1/2- 

Under Mi, it is easy to see that 

/pX.+l/2/^^y(-l)/2/ ^(l+m2/^2)(^_l)yn-p-2)/2-. 

« C2(a,r) 



2/ \^rr/(r-i)y p(r-l) 

pxQ+l/2 , / 1 _L ^^2 /^2 ^ _ 1 \ (n-p-2)/2-a 



r/2(^ _ i)p(r-i)/2 (-^ ^ l + rajg p-l \ 
\ r — 1 P / 

.PN.+1/2/ .-1 y+7 (i+ag/aY-^ y/Y (i+mg/g^) y-"^ 

l+"/n ^^2/^2^r~lXP/2 /_(l+^^2/^2)(^_ 1) 



,/px«+i/2/ r-1 v^V(i + a>^r"iy/^ /- 



2r(l + a2/a2) 



> 1, 



which goes to infinity under Mi when 

r 

indicating that cr^/cr^ > r^/^''"!) - 1 = h{r). 
(c) Using Lemma [3l under Mq, 

.X .4^1 /? / 1 \ /2 / l^(n-p-l)/2^r- - l\-(«-P-2)/2+a 

SFio«C3(a)?i"+i/2(^-j (1--' 
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which clearly goes to under Mq as both r and p go to infinity, where 

( \ - 

Furthermore, when Mi is true, 



r/ v r/ \ 



/ 1\a+l/2 /I 4- n-2 /n-2\ ("-P)/2-(l+«) 

which approaches infinity when (1 + a'^/a'^)/r^^^'^~^^ > 1. Namely, cr^ > when both r and p go to 
infinity. This completes the proof. 
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(p, r) 


a 





0.5 


1 2 


3 


5 



-1/2 0.894 0.358 0.476 0.594 0.662 0.733 



-1/4 0.860 0.400 0.521 0.630 0.694 0.759 

(2, 5) 

-1/5 0.855 0.408 0.528 0.637 0.700 0.762 

-1/10 0.845 0.421 0.540 0.647 0.707 0.770 



-1/2 0.937 0.436 0.562 0.677 0.729 0.790 

-1/4 0.914 0.470 0.593 0.700 0.751 0.807 

(2, 10) 

-1/5 0.910 0.475 0.597 0.704 0.755 0.810 

-1/10 0.901 0.483 0.606 0.711 0.766 0.816 



-1/2 0.973 0.660 0.753 0.824 0.851 0.883 

-1/4 0.964 0.678 0.765 0.833 0.859 0.887 

(2, 50) 

-1/5 0.962 0.680 0.767 0.834 0.860 0.888 

-1/10 0.959 0.686 0.770 0.836 0.862 0.890 



-1/2 0.984 0.734 0.812 0.866 0.891 0.914 

-1/4 0.979 0.747 0.820 0.871 0.897 0.917 

(2, 100) 

-1/5 0.978 0.748 0.821 0.872 0.897 0.918 

-1/10 0.976 0.752 0.823 0.874 0.899 0.919 



-1/2 0.993 0.865 0.901 0.933 0.944 0.954 

-1/4 0.991 0.870 0.905 0.936 0.945 0.955 

(2, 500) 

-1/5 0.991 0.871 0.905 0.936 0.946 0.956 

-1/10 0.990 0.872 0.906 0.937 0.956 0.967 



Table 1: The relative frequency of choosing the true model under the Bayes factor in (jl7p with 
four different values of the hyperparameter a for a fixed value of p and increasing values of r in the 
10, 000 simulations. 
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{p, 


r) 


a 





0.5 


1 


2 


3 


5 






-1/2 


0.872 


0.329 


0.486 


0.687 


0.788 


0.887 






-1/4 


0.825 


0.405 


0.570 


0.752 


0.838 


0.918 


(5, 


2) 














-1/5 


0.816 


0.425 


0.586 


0.762 


0.847 


0.922 






-1/10 


0.800 


0.446 


0.612 


0.780 


0.861 


0.930 



(10, 2) 



-1/2 
-1/4 
-1/5 
-1/10 



0.950 
0.923 
0.916 
0.908 



0.266 
0.338 
0.350 
0.378 



0.495 0.768 

0.576 0.823 

0.589 0.832 

0.615 0.848 



0.891 
0.923 
0.928 
0.936 



0.966 
0.979 
0.981 
0.983 





-1/2 


1.000 


0.080 


0.500 


0.961 


0.998 


1.000 




-1/4 


1.000 


0.108 


0.561 


0.973 


0.999 


1.000 


(50, 2) 














-1/5 


1.000 


0.112 


0.572 


0.979 


0.999 


1.000 




-1/10 


1.000 


0.123 


0.592 


0.979 


1.000 


1.000 



(100, 2) 



-1/2 
-1/4 
-1/5 
-1/10 



1.000 
1.000 
1.000 
1.000 



0.025 
0.033 
0.035 
0.039 



0.500 0.996 

0.559 0.998 

0.569 0.998 

0.588 0.998 



1.000 
1.000 
1.000 
1.000 



1.000 
1.000 
1.000 
1.000 



(500, 2) 



-1/2 
-1/4 
-1/5 
-1/10 



1.000 
1.000 
1.000 
1.000 



0.000 
0.000 
0.000 
0.000 



0.494 1.000 

0.528 1.000 

0.534 1.000 

0.544 1.000 



1.000 
1.000 
1.000 
1.000 



1.000 
1.000 
1.000 
1.000 



Table 2: The relative frequency of choosing the true model under the Bayes factor in (jl7p with 
four different values of the hyperparameter a for a fixed value of r and increasing values of p in the 
10, 000 simulations. 



18 





r) 


a 





0.5 


1 


2 


3 


5 






-1/2 


0.764 


0.348 


0.423 


0.520 


0.580 


0.654 






-1/4 


0.714 


0.407 


0.485 


0.533 


0.634 


0.701 


(2, 


2) 














-1/5 


0.705 


0.417 


0.496 


0.587 


0.642 


0.708 






-1/10 


0.690 


0.434 


0.511 


0.600 


0.657 


0.721 



(10, 5) 



-1/2 
-1/4 
-1/5 
-1/10 



0.996 
0.994 
0.993 
0.993 



0.465 
0.519 
0.527 
0.542 



0.812 
0.841 
0.845 
0.855 



0.967 
0.973 
0.975 
0.977 



0.990 
0.993 
0.993 
0.993 



0.998 
0.998 
0.998 
0.999 



-1/2 


0.992 


0.630 


0.836 


0.942 


0.972 


0.989 


-1/4 


0.987 


0.668 


0.858 


0.950 


0.976 


0.990 


(5, 10) 














-1/5 


0.986 


0.675 


0.861 


0.951 


0.977 


0.991 


-1/10 


0.984 


0.686 


0.867 


0.954 


0.978 


0.991 



(10, 10) 



-1/2 
-1/4 
-1/5 
-1/10 



1.000 
1.000 
1.000 
1.000 



0.748 
0.778 
0.783 
0.793 



0.953 
0.960 
0.962 
0.964 



0.995 
0.997 
0.997 
0.997 



0.991 
0.993 
0.993 
0.993 



1.000 
1.000 
1.000 
1.000 



-1/2 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


-1/4 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


(50, 25) 














-1/5 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


-1/10 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 



(25, 50) 



-1/2 
-1/4 
-1/5 
-1/10 



1.000 
1.000 
1.000 
1.000 



1.000 
1.000 
1.000 
1.000 



1.000 
1.000 
1.000 
1.000 



1.000 
1.000 
1.000 
1.000 



1.000 
1.000 
1.000 
1.000 



1.000 
1.000 
1.000 
1.000 



Table 3: The relative frequency of choosing the true model under the Bayes factor in (jl7p with four 
different values of the hyperparameter a for increasing values of p and r in the 10, 000 simulations. 
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(v. 


r) 


OL 


/3 


1 





0.5 


1 


2 


3 


5 






-1/2 





1 


0.853 


0.411 


0.530 


0.643 


0.702 


0.764 


(2, 


5) 


-1/2 





-1/2 


l/n 
1 


0.951 
0.899 


0.250 
0.350 


0.370 
0.469 


0.505 
0.586 


0.580 
0.656 


0.662 
0.729 












r 


0.797 


0.479 


0.585 


0.690 


0.741 


0.797 






-1/2 





1 


0.893 


0.492 


0.617 


0.718 


0.768 


0.821 


(2, 


10) 


-1/2 





-1/2 


l/n 
1 


0.979 
0.932 


0.323 
0.444 


0.460 
0.570 


0.592 
0.681 


0.662 
0.733 


0.734 
0.795 












r 


0.813 


0.582 


0.683 


0.769 


0.811 


0.855 






-1/2 





1 


0.945 


0.707 


0.787 


0.847 


0.871 


0.897 


(2, 


50) 


-1/2 





-1/2 


l/n 
1 


0.996 
0.964 


0.567 
0.677 


0.680 
0.765 


0.766 
0.832 


0.809 
0.858 


0.849 
0.887 












r 


0.819 


0.793 


0.852 


0.896 


0.901 


0.927 





-1/2 





1 


0.964 


0.767 


0.834 


0.884 


0.906 


0.925 




-1/2 





l/n 


0.999 


0.655 


0.751 


0.823 


0.856 


0.887 


(2, 100) 


-1/2 

















1 


0.978 


0.749 


0.821 


0.872 


0.898 


0.918 










r 


0.830 


0.847 


0.889 


0.926 


0.938 


0.947 




-1/2 





1 


0.993 


1.000 


1.000 


1.000 


1.000 


1.000 




-1/2 





l/n 


0.991 


1.000 


1.000 


1.000 


1.000 


1.000 


(2, 500) 


-1/2 



















1 


0.991 


1.000 


1.000 


1.000 


1.000 


1.000 










r 


0.990 


1.000 


1.000 


1.000 


1.000 


1.000 



Table 4: The relative frequency of choosing the true model under the Bayes factor in (jl3p for the 
Pearson type VI prior with different values of the hyperparameters {a, (3, k) for a fixed value of p 
and increasing values of r in the 10, 000 simulations. 
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(v. t) 


OL 


/3 


1 





0.5 


1 


2 


3 


5 




-1/2 





1 


0.801 


0.445 


0.611 


0.780 


0.860 


0.930 


(5, 2) 


-1/2 





-1/2 


l/n 
1 


0.945 
0.870 


0.171 
0.332 


0.293 
0.491 


0.489 
0.692 


0.627 
0.791 


0.776 
0.889 










r 


0.763 


0.499 


0.659 


0.812 


0.886 


0.943 




-1/2 





1 


0.850 


0.497 


0.722 


0.908 


0.963 


0.991 


(10, 2) 


-1/2 





-1/2 


l/n 
1 


0.982 
0.908 


0.115 
0.555 


0.290 
0.614 


0.584 
0.928 


0.756 
0.972 


0.913 
0.994 










r 


0.809 


0.562 


0.776 


0.930 


0.973 


0.995 




-1/2 





1 


0.931 


0.818 


0.989 


1.000 


1.000 


1.000 


(50, 2) 


-1/2 





-1/2 


l/n 
1 


1.000 
0.962 


0.186 
0.720 


0.704 
0.979 


0.990 
1.000 


1.000 
1.000 


1.000 
1.000 










r 


0.899 


0.876 


0.995 


1.000 


1.000 


1.000 





-1/2 





1 


0.953 


0.960 


1.000 


1.000 


1.000 


1.000 




-1/2 





l/n 


1.000 


0.420 


0.964 


1.000 


1.000 


1.000 


(100, 2) 


-1/2 

















1 


0.977 


0.924 


1.000 


1.000 


1.000 


1.000 










r 


0.922 


0.977 


1.000 


1.000 


1.000 


1.000 




-1/2 





1 


0.995 


1.000 


1.000 


1.000 


1.000 


1.000 




-1/2 





l/n 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


(500, 2) 


-1/2 



















1 


0.993 


1.000 


1.000 


1.000 


1.000 


1.000 










r 


0.999 


1.000 


1.000 


1.000 


1.000 


1.000 



Table 5: The relative frequency of choosing the true model under the Bayes factor in (jl3p for the 
Pearson type VI prior with different values of the hyperparameters (a, k) for a fixed value of r 
and increasing values of p in the 10, 000 simulations. 
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r) 


a 


/3 


7 





0.5 


1 


2 


3 


5 






-1/2 





1 


0.734 


0.382 


0.461 


0.553 


0.613 


0.683 


(2, 


2) 


-1/2 





-1/2 


1/n 
1 


0.842 
0.799 


0.255 

n ono 

0.308 


0.320 

n o o 1 

0.381 


0.419 
0.478 


0.483 

n con 

0.539 


0.564 

n 1 n 

0.619 












r 


0.713 


0.407 


0.486 


0.578 


0.634 


0.702 






-1/2 





1 


0.938 


r\ on CT 

0.805 


n n r n 

0.959 


n nno 

0.993 


n nno 

0.998 


1 nnn 

1.000 


(10, 


5) 


-1/2 





-1/2 


1/n 
1 


0.998 
U.9dd 


0.376 
0.734 


0.746 

n no^^ 

0.936 


0.951 

n nno 

0.992 


0.985 

n nn*? 

0.997 


0.997 

1 nnn 

1.000 












r 


0.859 


0.890 


0.980 


0.997 


0.999 


1.000 






-1/2 





1 


0.945 


n '7n CT 

0.795 


n n 1 

0.916 


n n*? /I 

0.974 


n no o 

0.988 


n nn/^ 

0.996 


(5, 


10) 


-1/2 






1 /o 

-1/2 


1/n 

-1 
i 


0.999 
U.9d9 


0.485 
0. /46 


0.748 
0.856 


0.903 
0.965 


0.951 

0.984 


0.980 
0.993 












r 


0.844 


0.881 


0.956 


0.983 


0.994 


0.998 






-1/2 





1 


0.968 


n n c o 

0.952 


n nn /i 

0.994 


n nnn 

0.999 


1 nnn 

1.000 


1 nnn 

1.000 


(10, 


10) 


-1/2 





-1/2 


1/n 
1 


1.000 

r\ no /I 

0.984 


0.731 

n non 

0.930 


0.948 

n nm 

0.991 


0.994 

n nni 

0.991 


1.000 

1 nnn 

1.000 


1.000 

1 nnn 

1.000 












r 


0.872 


0.981 


0.998 


1.000 


1.000 


1.000 






-1/2 





1 


n nn'7 

0.997 


1 nnn 

1.000 


1 nnn 

1.000 


1 nnn 

1.000 


1 nnn 

1.000 


1 nnn 

1.000 


(25, 


50) 


-1/2 





-1/2 


1/n 
1 


1.000 
0.990 


1.000 
1.000 


1.000 
1.000 


1.000 
1.000 


1.000 
1.000 


1.000 
1.000 












r 


0.911 


1.000 


1.000 


1.000 


1.000 


1.000 






-1/2 





1 


0.996 


1.000 


1.000 


1.000 


1.000 


1.000 


(50, 


25) 


-1/2 





-1/2 


1/n 
1 


1.000 
0.998 


1.000 
1.000 


1.000 
1.000 


1.000 
1.000 


1.000 
1.000 


1.000 
1.000 












r 


0.933 


1.000 


1.000 


1.000 


1.000 


1.000 



Table 6: The relative frequency of choosing the true model under the Bayes factor in (jl3p for the 
Pearson type VI prior with different values of the hyperparameters (a, /3, k) for increasing values 
of p and r in the 10, 000 simulations. 
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